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(54) Method and apparatus for runtime merging of hierarchical trees 



(57) Embodiments of the present invention relate to 
a runtime merge system with a reference node imple- 
mentation. According to one or more embodiments of 
the present invention, a reference node is implemented 
which holds a reference to a node in a DOM tree active 
in memory. The reference node class allows adding 
nodes to the merged tree without having to make a clone 
of the node, which is an expensive operation. In one em- 
bodiment, if a particular node is not present below a cer- 
tain level of the tree in any layer except a unique layer, 
it renders visiting the children of that node unnecessary. 
A reference is kept to the node in the memory. 
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Description 

[0001 ] Portions of the disclosure of this patent docu- 
ment contain material that is subject to copyright protec- 
tion. The copyright owner has no objection to the fac- 5 
simile reproduction by anyone of the patent document 
or the patent disclosure as it appears in the Patent and 
Trademark Office file or records, but otherwise reserves 
all copyright rights whatsoever. 

BACKGROUND OF THE INVENTION 

1 . FIELD OF THE INVENTION 

[0002] The present invention relates to a method and 
apparatus of runtime merging of hierarchical trees. 

2. BACKGROUND ART 

[0003] The Internet has made it possible for computer 
users to access to their applications, files, and data from 
any web-enabled device. The applications, files, and da- 
ta are usually stored on a server and are typically organ- 
ized in a hierarchical manner. There are preferences as- 
sociated with each layer in the hierarchy so that each 
user has a unique presentation of the applications, files 
and data. The hierarchy is normally stored using a doc- 
ument object model (DOM) and might access data in an 
extensible markup language (XML) format. 
[0004] When multiple preferences exist in the hierar- 
chy, the system must choose between equivalent nodes 
in multiple trees, so that the preference that takes prec- 
edence is chosen in a final "merge tree". To do so, how- 
ever, is an expensive process as will be further ex- 
plained below. Before discussing this problem, an over- 
view is provided. 

Internet 

[0005] The Internet is a network connecting many 
computer networks and is based on a common address- 
ing system and communications protocol called TCP/IP 
(Transmission Control Protocol/Internet Protocol). From 
its creation it grew rapidly beyond its largely academic 
origin into an increasingly commercial and popular me- 
dium. By the mid-1 990s the Internet connected millions 
of computers throughout the world. Many commercial 
computer network and data services also provided at 
least indirect connection to the Internet. 
[0006] The original uses of the Internet were electron- 
ic mail (e-mail), file transfers (ftp or file transfer protocol), 
bulletin boards and newsgroups, and remote computer 
access (telnet). The World Wide Web (web), which en- 
ables simple and intuitive navigation of Internet sites 
through a graphical interface, expanded dramatically 55 
during the 1 990s to become the most important compo- 
nent of the Internet. The web gives users access to a 
vast array of documents that are connected to each oth- 



er by means of links, which are electronic connections 
that link related pieces of information in order to allow a 
user easy access to them. Hypertext allows the user to 
select a word from text and thereby access other docu- 
ments that contain additional information pertaining to 
that word; hypermedia documents feature links to imag- 
es, sounds, animations, and movies. 
[0007] The web operates within the Internet's basic 
client-server format; Servers are computer programs 
that store and transmit documents (i.e., web pages) to 
other computers on the network when asked to, while 
clients are programs that request documents from a 
server as the user asks for them. Browser software al- 
lows users to view the retrieved documents. A web page 
with its corresponding text and hyperlinks is normally 
written in HTML or XML and is assigned an online ad- 
dress called a Uniform Resource Locator (URL). 

XML DOM 

[0008] XML is emerging as the next generation of 
markup languages. XML DOM details the characteristic 
properties of each element of a web page, thereby de- 
tailing how one might manipulate these components 
and, in turn, manipulate the page. Each component is 
stored in memory. Components include for instance, ob- 
jects, properties, methods, and events. An object is a 
container which reflects a particular element of a page. 
Objects contain the various characteristics which apply 
to that element (known as properties and methods). For 
example, the submit object contains properties and 
methods relevant to the submit button in a form. 
[0009] Properties are characteristics of an object; for 
example, the document object possesses a bgColor 
property which reflects the background color of the 
page. Using a programming language (e.g. , JavaScript) 
one may, via this property, read or modify the color of 
the current page. Some objects contain very many prop- 
erties, some contain very few. Some properties are 
read-only while others can be modified, possibly result- 
ing in immediate on-screen results. 
[0010] A method typically executes an action which 
somehow acts upon the object by which it is owned. 
Sometimes the method also returns a result value. 
Methods are triggered by the programming language 
being used, such as JavaScript. For example, the win- 
dow object possesses a method named alert( ). When 
supplied with string data, the alert( ) method causes a 
window to pop up on the screen containing the data as 
its message; (e.g., alertf Invalid entry!")). 
[0011] An event is used to trap actions related to its 
owning object Typically, these actions are caused by 
the user. For example, when the user clicks on a submit 
button, this is a click event which occurs at the submit 
object. By virtue of submitting a f orm, a submit event is 
also generated, following the click event. Although these 
events occur transparently, one can choose to intercept 
them and trigger specified program code to execute. 
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Preferences 

[0012] Using a web-enabled device to access data, 
files, and applications over the Internet significantly re- 
duces issues associated with installation, configuration, 
maintenance, upgrades for end users and information 
technology (FT) departments. Furthermore, ft eliminates 
license fees and lowers the total cost of ownership for 
enterprise and service providers. Enterprises are gen- 
erally hierarchical in nature. Application and user pref- 
erences stored for each user running desktop applica- 
tions are mostly collected from more than one layer. 
[0013] For example, an organization can have users 
belonging to a group. Any preference data that is absent 
in the user layer can be picked up from the group layer. 
Groups can have system-wide global administrators 
that can dictate enterprise level policies. Layering data 
in IT organization provides a hierarchical structure to da- 
ta and decreases redundancy in data storage. Imagine 
a specific preference data for 1 00 users belonging to the 
same group being the same for all users (i.e., depart- 
ment name); the deployment environment is left with two 
choices; one, replicate the same information for all us- 
ers, or two, keep the specific preference data in the 
group layer and merge the information at runtime with 
the user's preference data during the extraction of the 
information from a backend data store. 
[001 4] In the prior art, the second alternative is picked 
more frequently than the first one. For systems where 
the backend data store for configuration data is XML 
stored in flat files, this data is fetched through a config- 
uration server which helps the user to read, edit and de- 
lete application data through a pure application program 
interface (API) that is platform independent, such as the 
Java API. A call to read preference data is translated to 
the correct set of XML files from the required layers and 
the data is merged at runtime to create the resultant data 
for the end client application. Often, the task of merging 
is left to the configuration server that uses pure DOM 
API to read XML from all the required layers and merge 
them into a temporary DOM tree that is written back to 
a socket stream. 

Problems Associated with Merging DOM Trees On 
The Fiy 

[0015] Though the DOM API is simple and easy to 
use, it comes with the price of memory allocations that 
are proportionate to the size of data (e.g., XML data) 
and the depth of nesting. In general, tree traversals are 
expensive and it is wise to complete the decision of 
choosing the right information while traversing the tree 
for the first time, the difficulty in traversing unbalanced 
trees is due to the fact that leaf nodes present at a par- 
ticular depth in one tree may not be present in the other 
trees that need to be traversed. XML DOM exacerbates 
the problem by not allowing data in a DOM tree to be 
manipulated without copying the node. Merging data 



from various layers involves two major traversals: 

1. The first traversal should traverse all the XML 
trees from the various layers to find the winning 

s nodes (nodes that should be present in the merged 
tree). This can be done by copying the data to a 
temporary tree through the use of the cloneNodeO 
method from the DOM API. 

2. Writing the temporary tree to the output stream 
io used by a TCP socket that transports the data to 

the client. 

[001 6] Unfortunately, cloneNode() is the most expen- 
sive call in an XML parser. It has a list of drawbacks that 
'5 makes it very unpopular to XML users: 

1 . The call is recursive and so ft takes a lot of stack 
space depending on the depth and size of the XML 
node that is cloned. It also allocates internal data 

20 structures that help build the cloned node that are 
not freed until the call is complete. 

2. Cloning large trees can prove to be very expen- 
sive specifically when it is used frequently in a mul- 

25 ti threaded environment. 

3. Cloning creates a second copy of the data. If the 
data is merely meant for reading, creating a copy of 
the same data does not provide any advantage. In 

30 fact, it is disadvantageous because it takes more 
memory and CPU cycles from the machine. 

4. The call to cloneNode() is typically used often. 

35 SUMMARY OF THE INVENTION 

[0017] It is an object of the invention to provide for 
merging of hierarchical trees at reduced complexity. 
[0018] According to an embodiment of the invention, 

40 a method for merging one or more hierarchical trees at 
runtime may comprise: examining one or more nodes in 
each of said hierarchical trees; determining if there are 
equivalent nodes in said hierarchical trees; picking one 
or more winning nodes from each of said equivalent 

45 nodes; and storing one or more reference nodes to said 
winning nodes. 

[001 9] Advantageously the trees are document object 
model (DOM) trees. 

[0020] Further a merged tree may be printed using 
so said reference nodes. 

[0021] Further, a program may have instructions 
adapted to make a computer carry out the above oper- 
ations and a computer readable medium may include 
the program. 

55 [0022] According to another embodiment, a data 
processing unit for merging one or more hierarchical 
trees at runtime may comprise: means for examining 
one or more nodes in each of said hierarchical trees; 
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means for determining if there are equivalent nodes in 
said hierarchical trees; means for picking one or more 
winning nodes from each of said equivalent nodes; and 
means for storing one or more reference nodes to said 
winning nodes. 

[0023] According to another embodiment, a computer 
program product may comprise: a computer usable me- 
dium having computer readable program code embod- 
ied therein configured to merge one or more hierarchical 
trees at runtime comprising: computer readable code 
configured to cause a computer to examine one or more 
nodes in each of said hierarchical trees; computer read- 
able code configured to cause a computer to determine 
if there are equivalent nodes in said hierarchical trees; 
computer readable code configured to cause a compu- 
ter to pick one or more winning nodes from each of said 
equivalent nodes; and computer readable code config- 
ured to cause a computerto store one or more reference 
nodes to said winning nodes. 

[0024] Further advantageous embodiments of the in- 
vention are disclosed in further claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0025] These and otherfeatures, aspects and advan- 
tages of the present invention will become better under- 
stood with regard to the following description, appended 
claims and accompanying drawings where: 

Fig. 1 shows how merge trees are created by one 
embodiment of the present invention. 

Fig. 2 is a diagram showing the use of a reference 
node according to an embodiment of the 
present invention. 

Fig. 3 is a flowchart showing the use of a reference 
node according to an embodiment of the 
present invention. 

Fig. 4 is a flowchart showing the use of an XML 
merge system according to an embodiment of 
the present invention. 

Fig. 5 is a diagram of three example trees in memory. 

Fig. 6 shows a merged tree from three example trees 
according to an embodiment of the present in- 
vention. 

Fig. 7 shows an embodiment of a computer execu- 
tion environment. 

DETAILED DESCRIPTION OF THE INVENTION 

[0026] Embodiments of the present invention relate to 
a runtime merge system with a reference node imple- 
mentation. According to one or more embodiments of 



the present invention, a reference node is implemented 
which holds a reference to a node in a DOM tree active 
in memory. A reference node class is implemented in 
one embodiment, which allows adding nodes to the 

5 merged tree without having to make a clone of the node, 
which is an expensive operation. 
[0027] In one embodiment, if a particular node is not 
present below a certain level of the tree in any layer ex- 
cept a unique layer, it renders visiting the children of that 

10 node unnecessary. In a typical scenario, the user tree 
may not have any information about a component (Of- 
fice, for instance). Without a reference node, one had to 
visit every node in the "Office" component to make a 
copy of the data. With the use of reference nodes, the 

15 invention neither traverses the children of such a node, 
nor copies any of its contents to a new node. A reference 
is simply kept to the node in the memory and later this 
reference is used to print the data to a stream. 
[0028] The present invention relates to a method and 

20 apparatus for runtime merging of hierarchical trees. In 
the following description, numerous specffic details are 
set forth to provide a more thorough description of em- 
bodiments of the invention. It will be apparent, however, 
to one skilled in the art, that the invention may be prac- 

25 ticed without these specific details. In other instances, 
well known features have not been described in detail 
so as not to obscure the invention. 

Runtime Merge System 

30 

[0029] In one embodiment, a merge system imple- 
mented which traverses DOM trees only once to decide 
on the winning nodes for the merged XML data (i.e., 
which preference takes precedence). Moreover, the 

35 system eliminates the use of cloneNode() from the 
whole infrastructure by defining a new concept of "Ref- 
erence node." A reference node is a node that keeps a 
reference or pointer to the location of the XML node rath- 
er than a copy of the node. The result is a very fast and 

40 efficient merging system with minimal allocation of 
memory. 

[0030] A merged tree refers to the merging of layers 
of XML node data, where the node chosen to be includ- 
ed in the merged tree is determined by the highest pri- 

45 ority layer. For instance, assume that the merged data 
for the user preference came from three layers of data: 
System, Group, and User; in the order of increasing pri- 
ority. The User layer has the highest priority, followed by 
Group and then System. Equivalent nodes in each tree 

so are compared. The node in the layer of highest priority, 
among the layers represented, is added to the merged 
tree. Once all the nodes have been compared, the de- 
cision to add which node in the merged tree (which we 
refer to as the winning node) is complete. 

55 [0031] Fig. 1 shows one embodiment of this process. 
At operation 100, multiple hierarchical layers are ob- 
tained (i.e., user, group, and system). At operation 110, 
priorities for each layer are obtained. At operation 120, 
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equivalent nodes for each tree are compared. At oper- 
ation 130, the equivalent node with the highest priority 
is selected to be added to the merged tree. At operation 
140, it is determined if there are any more equivalent 
nodes. If not, the process is complete and the merged 
tree is created at operation 150. Otherwise, the process 
repeats at operation 120. 

Reference Node Implementation 

[0032] The reference node holds a reference to a 
node in a XML DOM tree active in memory. A reference 
node class allows the adding of nodes to the merged 
tree without having to make a clone of the node. This 
class helps us to eliminate cloning nodes, which is an 
expensive operation. The concept of reference node is 
simple yet very powerful. It not only saves copying data 
but also avoids visiting every child of a node while build- 
ing the merged tree. 

[0033] If a particular node is not present below a cer- 
tain level of the tree in any layer except a unique layer, 
it renders visiting the chQdren of that node unnecessary. 
In a typical scenario, the user tree may not have any 
information about a component (Office, for instance). 
Before the introduction of the concept of reference node, 
one had to visit every node in the "Office" component to 
make a copy of the data With the use of Reference 
nodes, the invention neither traverses the children of 
such a node, nor copies any of its contents to a new 
node. A reference to the node is simply kept in the mem- 
ory. 

[0034] Fig. 2 shows the use of a reference node ac- 
cording to one embodiment of the present invention. 
Tree 200 is a hierarchically organized tree, such as a 
DOM tree. Node A has two children B and C. Reference 
node A— is used only as a reference to the actual node 
A (and its children B and C) in memory. Reference node 
A— is used to indicate that node A is a winning node 
when a merged tree is created without having to clone 
tree 200 and paying the associated cost in time and 
memory consumption. 

[0035] Fig. 3 is a flowchart showing how reference 
nodes are used according to an embodiment of the 
present invention. At operation 300 multiple hierarchical 
trees are obtained. At operation 310 all equivalent 
nodes in the trees are obtained and winning nodes are 
selected at operation 330. At operation 340 a reference 
node is placed in memory for each winning node where- 
in the reference node refers to the data node. 

Embodiment of an XML Merging System 

[0036] In one embodiment, the merging system is 
configured to operate on XML data. This embodiment is 
shown in Figure 4. At operation 400, the merged tree is 
first built from an empty XML document element. At op- 
eration 410, all of the layers of the tree are traversed 
through a recursive call. At operation 420, the nodes in 



each layer are compared to the equivalent nodes in the 
previous layers. In one embodiment, the layers are tra- 
versed starting with the highest priority layer down. 
Within each layer, nodes are visited a descendant level 
5 at a time. 

[0037] At operation 430, it is determined if a node is 
present only in one layer. If so, this node is selected as 
the winning node at operation 440 and a reference node 
is created for that particular node and added to the tem- 

10 porary XML document, which represents the merged 
tree at operation 450. If, on the other hand, at operation 
430, a node is present in at least two layers, the winning 
node is selected as the node in the highest priority layer 
at operation 460. 

15 [0038] A shallow clone (a copy of just the root node 
of the subtree and its attributes, but not including any 
children) of the node in the highest priority layer, is made 
and added to the temporary XML document at operation 
470. Shallow clones, as opposed to full clone nodes, are 

20 relatively inexpensive and reduce the amount of copying 
needed. At this point, only a shallow done is part of the 
winning node because the children may still be present 
in more than one layer, so at operation 480, it is deter- 
mined if there are children for the current node. If so, 

25 the process repeats at step 430. 

[0039] If there are no more children at step 480, then 
the layer is completely traversed, and at step 490 it is 
determined if there are unique nodes in the lower layers. 
If so, step 430 repeats. Otherwise, the traversal of all 

30 the trees is completed. At this point, the temporary XML 
document contains a list of nodes and reference nodes, 
which comprise the merged tree. The tree is now out- 
putted to a stream (StringBuffer) at step 495. At this 
point, when the output method identifies a reference 

35 node, the actual contents of the node (with all its chil- 
dren) are retrieved from the reference and outputted to 
the stream buffer. This ensures the contents of such a 
node are visited once and only once. 

40 Case Study 

[0040] Imagine an enterprise where desktop applica- 
tions are accessed via a computer network. For simplic- 
ity, the users in the organization have their data divided 

45 into three layers; namely a user layer, a group layer and 
an administrative layer. The group layer defines the set 
of data that is common to all users in the same group 
(for example, an engineering group or a human resourc- 
es group). Administrators keep data that are common 

so to the organization or all the groups in the organization. 
If the user of a group would like to fetch the preference 
data that pertains to any office application, the informa- 
tion is distributed in three layers, each represented as 
a DOM tree in memory. 

55 [0041] Nodes (carrying preference data) in the user 
layer have precedence over the group layer and the 
group layer has precedence over the administrative lay- 
er. So if the same node is present in the user and group 
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layer, the winning node is picked from the user layer, if 
the node is present n the group and the admin, it is 
picked from the group layer, etc. 
[0042] Though an organization can have as many lay- 
ers as possible, the following seven cases uniquely 
identifies the conditional blocks in one embodiment of 
an iterative merge system working on three layers (User, 
Group, Administrative) 

Case 1 : XML node present in all the three layers (A, 
G,U) 

Case 2: XML node present in User and Group (U, 
G) and not present in admin layer. 

Case 3: XML node present in Group and Admin (G, 
A) and not present in User layer. 

Case 4: XML node present in User and Admin (U, 
A) and not present in group layer. 

Case 5: XML node present in User (U) only and not 
present in admin and group layer. 

Case 6: XML node present in Admin (A) only and 
not present in user and group layer. 

Case 7: XML node present in Group (G) only and 
not present in admin and user layer. 

[0043] Fig. 5 shows the three trees in memory. Mem- 
ory 500 contains group tree 510, user tree 520, and ad- 
min tree 530. Each tree is hierarchical and contains sev- 
eral hierarchical nodes. Group tree 51 0 includes a root 
540 called office and children C and B. Node B is a par- 
ent of child N. Admin tree 530 includes a root 550, which 
has children D and C. Node C is a parent of node M. 
Node D is a parent of node E, which is a parent of nodes 
F and G. User tree 520 includes a root 560, termed of- 
fice, and children A and B. Node A has children L and O. 

Steps In Executing the Runtime Merge System 

[0044] To achieve a merged tree from the trees in Fig. 
5, one embodiment of the present invention operates as 
follows. 

[0045] Iteration 1 : An XML Document element is cre- 
ated with no leaf nodes attached to it. This temporary 
document is used to create the merged tree. 
[0046] Iteration 2: Node "A" is not present in any pre- 
ceding layers. So a reference node from "A" is created 
(a node that holds reference to the actual data) and add- 
ed to the merged tree under "Office'. 
[0047] Iteration 3: Node "B" is present in both group 
and user and so a shallow copy of just "B" is made from 
the user layer and added to the merged tree under "Of- 
fice". 

[0048] Iteration 4: Node "C" is present in both group 



and admin, so a shallow copy of just "C" is made from 
the group layer and added to the merged tree under "Of- 
fice". 

[0049] Iteration 5: Node "D" is only present in admin 

s and so a reference for this node is created and added 
to the merged tree under "Office". 
[0050] Iteration 6: The child under °B n is present only 
in the group layer, so a reference node is made of the 
leaf node, "N", in the group layer and added to the 

10 merged tree under "Office/B". 

[0051 ] Iteration 7: The child under "C" is present only 
in the admin layer, so a reference is made of the leaf 
node, "M", from the admin layer and added to the 
merged tree under "Offlce/C". 

is [0052] Iteration 8. Now that the merged tree is creat- 
ed, a recursive print function iterates through the whole 
merged tree, built out of references, and prints the 
nodes to a stream. The onfy difference with this print 
function and a standard print function in the parser is 

20 that it understands the concept of reference nodes. 
Once it hits a reference node, it knows how to extract 
the actual element node in the reference and call its 
"print 0 function. 

[0053] In one embodiment, the whole mechanism is 
25 based on object oriented programming and so each type 
of node in the tree will do its work to print itself to the 
stream. Figure 6 shows the merged tree and what nodes 
in the merged tree refer to the actual nodes in the DOM 
tree. Figure 6 comes as a result of iterations 1 through 
30 8 above. It includes a root 600, reference node A 605 
which refers to node A 610; reference node B 615; ref- 
erence node N 620 which refers to data node N 625; 
reference node C 630; reference node M 635 which re- 
fers to data node M 640; and reference node D 645 
35 which refers to data node D 650. 

[0054] The system of various embodiments of the 
present invention eliminates the use of cloning nodes 
and introduces a new concept of reference nodes that 
helps documents (e.g., XML documents) to be merged 
40 at runtime. The data structures required for the tempo- 
rary merged tree are minimized. The invention saves 
copying of the same data in multiple places. 
[0055] The new runtime merge system increases the 
speed of merging XML data at runtime. In one embodi- 
es ment, speed is increased by approximately five times 
over prior art methods, The invention also reduces a bot- 
tleneck of performance (the use of cloneNode to repli- 
cate information for XML nodes), improves the response 
time for clients to fetch preference data, merges data 
50 effectively, introduces the use of reference nodes in 
XML to eliminate the copying of data, and considerably 
reduces the size of the intermediate data structures that 
need to be allocated for merging XML documents. In 
one embodiment, the invention is used to merge data of 
55 approximately 30 Kb in Admin tree, 5 Kb in Group tree, 
and 3 Kb in user tree in approximately 25 to 30 millisec- 
onds. 

[0056] According to another embodiment, a data 
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processing unit may be provided for merging one or 
more hierarchical trees at runtime, the data processing 
unit comprising: means for examining one or more 
nodes in each of said hierarchical trees; means for de- 
termining if there are equivalent nodes in said hierarchi- 
cal trees; means for picking one or more winning nodes 
from each of said equivalent nodes; and means for stor- 
ing one or more reference nodes to said winning nodes. 
[0057] The trees may be document object model 
(DOM) trees, and a merged tree using said reference 
nodes may be printed. 

[0058] The data processing unit may include a server 
and a client, the server and client being adapted to 
merge one or more hierarchical trees at runtime. 
[0059] It is noted that a computer-readable medium 
may be provided having a program embodied thereon, 
where the program is to make a computer or a system 
of data processing devices to execute functions or op- 
erations of the features and elements of the above de- 
scribed examples. A computer-readable medium can be 
a magnetic or optical or other tangible medium on which 
a program is recorded, but can also be a signal, e.g. 
analog or digital, electronic, magnetic or optical, in which 
the program is embodied for transmission. Further, a 
computer program product may be provided comprising 
the computer-readable medium. 

Embodiment of Computer Execution Environment 
(Hardware) 

[0060] An embodiment of the invention can be imple- 
mented as computer software in the form of computer 
readable program code executed in a general purpose 
computing environment such as environment 700 illus- 
trated in Figure 7, or in the form of bytecode class files 
executable within a Java™ run time environment run- 
ning in such an environment, or in the form of bytecodes 
running on a processor (or devices enabled to process 
bytecodes) existing in a distributed environment (e.g., 
one or more processors on a network). A keyboard 710 
and mouse 711 are coupled to a system bus 718. The 
keyboard and mouse are for introducing user input to 
the computer system and communicating that user input 
to central processing unit (CPU) 713. Other suitable in- 
put devices may be used in addition to, or in place of, 
the mouse 711 and keyboard 71 0. I/O (input/output) unit 
71 9 coupled to bi-directional system bus 71 8 represents 
such I/O elements as a printer, A/V (audio/video) I/O, 
etc. 

[0061] Computer 701 may include a communication 
interface 720 coupled to bus 71 8. Communication inter- 
face 720 provides a two-way data communication cou- 
pling via a network link 721 to a local network 722. For 
example, if communication interface 720 is an integrat- 
ed services digital network (ISDN) card or a modem, 
communication interface 720 provides a data commu- 
nication connection to the corresponding type of tele- 
phone line, which comprises part of network link 721 . If 



communication interface 720 is a local area network 
(LAN) card, communication interface 720 provides a da- 
ta communication connection via network link 721 to a 
compatible LAN. Wireless links are also possible. In any 
5 such implementation, communication interface 720 
sends and receives electrical, electromagnetic or optical 
signals which carry digital data streams representing 
various types of information. 

[0062] Network link 721 typically provides data com- 
munication through one or more networks to other data 
devices. For example, network link 721 may provide a 
connection through local network 722 to host computer 
723 or to data equipment, operated by ISP 724. ISP 724 
in turn provides data communication services through 
the world wide packet data communication network now 
commonly referred to as the "Internet" 725. Local net- 
work 722 and Internet 725 both use electrical, electro- 
magnetic or optical signals which carry digital data 
streams. The signals through the various networks and 
the signals on network link 721 and through communi- 
cation interface 720, which carry the digital data to and 
from computer 700, are exemplary forms of carrier 
waves transporting the information. 
[0063] Processor 713 may reside wholly on client 
computer 701 or wholly on server 726 or processor 71 3 
may have its computational power distributed between 
computer 701 and server 726. Server 726 symbolically 
is represented in Figure 7 as one unit, but server 726 
can also be distributed between multiple "tiers". In one 
embodiment, server 726 comprises a middle and back 
tier where application logic executes in the middle tier 
and persistent data is obtained in the back tier. In the 
case where processor71 3 resides wholly on server 726, 
the results of the computations performed by processor 
713 are transmitted to computer 701 via Internet 725, 
Internet Service Provider (ISP) 724, local network 722 
and communication interface 720. In this way, computer 
701 is able to display the results of the computation to 
a user in the form of output. 

[0064] Computer 701 includes a video memory 714, 
main memory 71 5 and mass storage 712, all coupled to 
bi-directional system bus 718 along with keyboard 710, 
mouse 711 and processor 713. As with processor 713, 
in various computing environments, main memory 715 
and mass storage 712, can reside wholly on server 726 
or computer 701, or they may be distributed between 
the two. Examples of systems where processor 713, 
main memory 715, and mass storage 712 are distribut- 
ed between computer 701 and server 726 include the 
thin-client computing architecture developed by Sun Mi- 
crosystems, Inc., the palm pilot computing device and 
other personal digital assistants, Internet ready cellular 
phones and other Internet computing devices, and in 
platform independent computing environments, such as 
those which utilize the Java technologies also devel- 
oped by Sun Microsystems, Inc. XML DOM trees and 
identifiers for the nodes in the DOM trees 795 may be 
stored in main memory 715 with a cache 790. Objects 
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removed from the cache may be stored in an area 796 
of mass storage 712. 

[D065] The mass storage 71 2 may include both fixed 
and removable media, such as magnetic, optical or 
magnetic optical storage systems or any other available s 
mass storage technology. Bus 718 may contain, for ex- 
ample, thirty-two address lines for addressing video 
memory 71 4 or main memory 71 5. The system bus 718 
also includes, for example, a 32-bit data bus for trans- 
ferring data between and among the components, such 10 
as processor 71 3, main memory 71 5, video memory 71 4 
and mass storage 712. Alternatively, multiplex data/ad- 
dress lines may be used instead of separate data and 
address lines. 

[0066] In one embodiment of the invention, the proc- 15 
essor 71 3 is a microprocessor manufactured by Motoro- 
la, such as the 680X0 processor or a microprocessor 
manufactured by Intel, such as the 80X86, or Pentium 
processor, or a SPARC microprocessor from Sun Mi- 
crosystems, Inc. However, any other suitable micro- 20 
processor or microcomputer may be utilized. Main 
memory 715 is comprised of dynamic random access 
memory (DRAM). Video memory 714 is a dual-ported 
video random access memory. One port of the video 
memory 71 4 is coupled to video amplifier 71 6. The video 25 
amplifier 71 6 is used to drive the cathode ray tube (CRT) 
raster monitor 717. Video amplifier 716 is well known in 
the art and may be implemented by any suitable appa- 
ratus. This circuitry converts pixel data stored in video 
memory 714 to a raster signal suitable for use by monitor 30 
717. Monitor 717 is a type of monitor suitable for dis- 
playing graphic images. 

[0067] Computer 701 can send messages and re- 
ceive data, including program code, through the network 
(s), network link 721 , and communication interface 720. 55 
In the Internet example, remote server computer 726 
might transmit a requested code for an application pro- 
gram through Internet 725, ISP 724, local network 722 
and communication interface 720. The received code 
may be executed by processor 71 3 as it is received, and/ *o 
or stored in mass storage 71 2, or other non-volatile stor- 
age for later execution. In this manner, computer 700 
may obtain application code in the form of a earner 
wave. Alternatively, remote server computer 726 may 
execute applications using processor 713, and utilize 
mass storage 712, and/or video memory 715. The re- 
sults of the execution at server 726 are then transmitted 
through Internet 725, ISP 724, local network 722 and 
communication interface 720. In this example, computer 
701 performs only input and output functions. so 
[0068] Application code may be embodied in any form 
of computer program product. A computer program 
product comprises a medium configured to store or 
transport computer readable code, or in which computer 
readable code may be embedded. Some examples of 55 
computer program products are CD-ROM disks, ROM 
cards, floppy disks, magnetic tapes, computer hard 
drives, servers on a network, and carrier waves. 



[0069] The computer systems described above are 
for purposes of example only. An embodiment of the in- 
vention may be implemented in any type of computer 
system or programming or processing environment. 
[0070] Thus, a runtime merge system using a refer- 
ence node implementation is described in conjunction 
with one or more specific embodiments. The invention 
is defined by the claims and their full scope of equiva- 
lents. 



Claims 

1. A method for merging one or more hierarchical 
trees at runtime comprising: 

examining one or more nodes in each of said 
hierarchical trees; 

determining if there are equivalent nodes in 
said hierarchical trees; 

picking one or more winning nodes from each 
of said equivalent nodes; and 

storing one or more reference nodes to said 
winning nodes. 

2. The method of daim 1 wherein said trees are doc- 
ument object model (DOM) trees. 

3. The method of at least one of the claims 1 and 2 
further comprising printing a merged tree using said 
reference nodes. 

4. Data processing unit for merging one or more hier- 
archical trees at runtime comprising: 

means for examining one or more nodes in 
each of said hierarchical trees; 

means for determining if there are equivalent 
nodes in said hierarchical trees; 

means for picking one or more winning nodes 
from each of said equivalent nodes; and 

means for storing one or more reference nodes 
to said winning nodes. 

5. The data processing unit of claim 4 wherein said 
trees are document object model (DOM) trees. 

6. The data processing unit of at least one of claims 4 
and 5 further comprising printing a merged tree us- 
ing said reference nodes. 

7. A computer program product comprising: 
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a computer usable medium having computer 
readable program code embodied therein con- 
figured to merge one or more hierarchical trees 
at runtime comprising: 

5 

computer readable code configured to 
cause a computer to examine one or more 
nodes in each of said hierarchical trees; 

computer readable code configured to 10 
cause a computer to determine if there are 
equivalent nodes in said hierarchical trees; 

computer readable code configured to 
cause a computer to pick one or more win- 
ning nodes from each of said equivalent 
nodes; and 

computer readable code configured to 
cause a computer to store one or more ref- 20 
erence nodes to said winning nodes. 

8. The computer program product of claim 7 wherein 
said trees are document object model (DOM) trees. 

25 

9. The computer program product of at least one of 
the claims 7 and 8 further comprising: computer 
readable code configured to cause a computer to 
print a merged tree using said reference nodes. 

30 

10. A program having instructions adapted to make a 
computer carry out the method of at least one of the 
claims 1-3. 

11 . A computer readable medium, in which a program 35 
is embodied, where the program is to make a com- 
puter execute the method of at least one of the 
claims 1-3. 
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